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Specification 



MEDIA DELIVERING APPARATUS AND MEDIA RECEIVING APPARATUS 



Field of the Invention 
5 The present invention relates to a media delivering 

apparatus which delivers media data by way of a network, and 
a media receiving apparatus which receives media data delivered 
thereto . 

Background of the Invention 

10 In order to deliver two or more media data containing video 

data, audio data, etc. by way of a network and to present 
delivered media data faithfully according to a scenario written 
in XML (extensible markup language) or the like which is a 
description language, there is a necessity to bring the two or 

15 more media data into synchronization with one another. As a 
method of bringing two or more media data into synchronization 
with one another, JP, 2003-9120 , A discloses a delivery of two 
or more media data while dynamically carrying out frame skipping, 
image quality degradation or the like according to the 

20 conditions of the network. 

Fig. 1 is a diagram explaining a media delivery system 
disclosed in the above-mentioned publication before 
examination. Servers Sa, Sb, and Sc each of which delivers 
media data are connected with a terminal device c which presents 

25 media data by way of a very thin channel, a thin channel, and 
a thick channel of a network 1, respectively. For example, 
while media data are simultaneously delivered from the servers 
Sa, Sb, and Sc, the server Sa connected to the very thin channel 
delivers media data in which frame skipping or image quality 

30 degradation is carried out so that the terminal device c can 
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provide a synchronous presentation of the media data 
simultaneously delivered from the servers Sa, Sb, and Sc 
according to a scenario written in XML or the like. 

A problem with the related art media delivering apparatus 
5 constructed as mentioned above is that since the receiving 
capability of the media receiving apparatus is not taken into 
consideration at the time of creating the scenario written in 
XML, there are cases where the scenario written in XML 
description cannot be executed depending on the receiving 

10 capability of the media receiving apparatus, that is, there is 
a possibility that unpresentable media data are delivered to 
the media receiving apparatus. Furthermore, there is a 
possibility that depending on the communication capability of 
the network, it takes time for the media receiving apparatus 

15 to adapt to a synchronous presentation of the two or more media 
data delivered thereto, and therefore a presentation of the two 
or more media data cannot be normally provided by the media 
receiving apparatus in the meantime. Another problem is thus 
that the media delivering apparatus cannot generate a scenario 

20 written in XML according to conditions at the time of delivery 
of the scenario, and the media receiving apparatus cannot 
normally present the two or more media data. 

The present invention is made in order to solve the 
above-mentioned problems, and it is therefore an object of the 

25 present invention to provide a media delivering apparatus which 
generates metadata for delivering media data in consideration 
of the receiving capability of a media receiving apparatus and 
the communication capability of a network so that the media 
receiving apparatus can present the media data normally. 

30 It is another object of the present invention to provide 
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a media receiving apparatus which can present media data 
normally based on metadata delivered from a media delivering 
apparatus . 

5 Disclosure of the Invention 

In accordance with the present invention, there is 
provided a media delivering apparatus which delivers media data 
to a media receiving apparatus by way of a network, the apparatus 
including: a parameter acquiring unit for acquiring at least 

10 one of a communication capability of the above-mentioned 
network, and a receiving capability of the above-mentioned 
media receiving apparatus; a media selecting unit for selecting 
media data to be delivered based on both a degree of media 
importance assigned to each of the above-mentioned media data 

15 and at least the one of the above-mentioned communication 
capability of the network and the above-mentioned receiving 
capability of the media receiving apparatus; a 
transmission-data generating unit for generating metadata in 
which both address information indicating a location of the 

20 selected media data and presentation layout information 
indicating a presentation layout of the above-mentioned media 
receiving apparatus which is determined based on both the degree 
of media importance of the selected media data and at least the 
one of the above-mentioned communication capability of the 

25 network and the above-mentioned receiving capability of the 
media receiving apparatus are described; a data transmitting 
unit for delivering the above-mentioned metadata to the 
above-mentioned media receiving apparatus by way of the 
above-mentioned network; and a media communication unit for 

30 delivering the above-mentioned media data in response to a 
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request from the above-mentioned media receiving apparatus 
which has received the above-mentioned metadata. 

The present invention offers an advantage of being able 
to normally present media data and being able to deliver media 
5 data which are suited for each media receiving apparatus. 

Brief Description of the Figures 

Fig. 1 is a diagram explaining a related art media delivery 
system; 

10 Fig. 2 is a diagram explaining the fundamental structure 

of a media delivery system which uses metadata in accordance 
with embodiment 1 of the present invention; 

Fig. 3 is a block diagram showing the internal structure 
of the media delivering apparatus in accordance with embodiment 
15 1 of the present invention; 

Fig. 4 is a block diagram showing the internal structure 
of a media communication unit in the media delivering apparatus 
in accordance with embodiment 1 of the present inventions- 
Fig. 5 is a block diagram showing the internal structure 
20 of a media receiving apparatus in accordance with embodiment 
1 of the present invention; 

Fig. 6 is a diagram showing an example of a presentation 
layout in a case of simultaneously monitoring four points in 
accordance with embodiment 1 of the present invention; 
25 Fig. 7 is a diagram showing an example of the metadata 

in accordance with embodiment 1 of the present invention; 

Fig. 8 is a diagram explaining the media delivery system 
which uses the metadata in accordance with embodiment 1 of the 
present invention; 
30 Fig. 9 is a block diagram showing the internal structure 



«> • 

5 

of the media delivering apparatus in accordance with embodiment 
1 of the present invention; 

Fig. 10 is a diagram showing an example of a presentation 
layout in a case of simultaneously monitoring six points in 
5 accordance with embodiment 1 of the present invention; 

Fig. 11 is a diagram showing an example of the metadata 
in accordance with embodiment 1 of the present invention; 

Fig. 12 is a diagram showing an example of a presentation 
layout in a case of simultaneously monitoring four points in 
10 accordance with embodiment 1 of the present invention; 

Fig. 13 is a diagram showing an example of the metadata 
in accordance with embodiment 1 of the present invention; 

Fig. 14 is a diagram showing an example of a presentation 
layout in a case of simultaneously monitoring three points in 
15 accordance with embodiment 1 of the present invention; 

Fig. 15 is a diagram showing an example of the metadata 
in accordance with embodiment 1 of the present invention; 

Fig. 16 is a diagram showing an example of the metadata 
in which conditional branching for determining the presentation 
20 layout in accordance with embodiment 1 of the present invention 
is included; 

Fig. 17 is a diagram explaining another example of the 
media delivery systems which uses the metadata in accordance 
with embodiment 1 of the present invention; 
25 Fig. 18 is a block diagram showing the internal structure 

of the media delivering apparatus in accordance with embodiment 
1 of the present invention; 

Fig. 19 is a block diagram showing the internal structure 
of the media receiving apparatus in accordance with embodiment 
30 1 of the present invention; 
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Fig. 20 is a diagram showing an example of transition of 
the presentation layout when a degree of media importance is 
changed in a case of simultaneously monitoring six points in 
accordance with embodiment 1 of the present invention; 
5 Fig. 21 is a diagram showing an example of change in the 

metadata when the degree of media importance is changed in 
accordance with embodiment 1 of the present invention; 

Fig. 22 is a diagram showing an example of switching 
between presentation layouts when a degree of media importance 
10 is changed in a case of simultaneously monitoring three points 
in accordance with embodiment 1 of the present invention; 

Fig. 23 is a diagram showing an example of change in the 
metadata when the degree of media importance is changed in 
accordance with embodiment 1 of the present invention; 
15 Fig. 24 is a diagram showing an example of definition of 

the format of a change command, and an example of the metadata 
in accordance with embodiment 1 of the present invention; 

Fig. 25 is a diagram showing an example of the change 
command in accordance with embodiment 1 of the present 
20 invention; 

Fig. 26 is a diagram showing an example of the change 
command in accordance with embodiment 1 of the present 
invention; 

Fig. 27 is a diagram explaining a media delivery system 
25 which uses metadata in accordance with embodiment 2 of the 
present invention; 

Fig. 28 is a block diagram showing the internal structure 
of a media delivering apparatus in accordance with embodiment 
2 of the present invention; 
30 Fig. 29 is a diagram showing an example of the metadata 



in a case where a time-varying variation in a degree of media 
importance and a time when the degree of media importance is 
changed are known in advance in the media delivering apparatus 
in accordance with embodiment 2 of the present invention; 

Fig. 30 is a diagram showing an example of the metadata 
and a change command in a case where a time-varying variation 
in a degree of media importance and a time when the degree of 
media importance is changed are known in advance in the media 
delivering apparatus in accordance with embodiment 2 of the 
present invention; 

Fig. 31 is a diagram showing an example of a presentation 
layout and an example of the metadata in a case of simultaneously 
monitoring four points in the media delivery system in 
accordance with embodiment 2 of the present invention; and 

Fig. 32 is a diagram explaining another example of the 
media delivery system which uses metadata in accordance with 
embodiment 2 of the present invention. 

Preferred Embodiments of the Invention 

Hereafter, in order to explain this invention in greater 
detail, the preferred embodiments of the present invention will 
be described with reference to the accompanying drawings. 
Embodiment 1 . 

This embodiment 1 will be explained by taking, as an 
example, a media delivering apparatus and a media receiving 
apparatus which handle, as media data, surveillance video image 
data encoded with MPEG-4 Visual. The media data itself can be 
anything as long as it is general media data such as video data 
or audio data. The application of the present invention is not 
limited to a surveillance system. Furthermore, assuming that 
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metadata in which the features of corresponding media data are 
compactly described is written in SMIL (synchronized multimedia 
integration language) which is an XML-based multimedia 
presentation description language, and RTP ( real time transport 
protocol, IETF RFC3550) is used as a media delivery protocol 
and RTSP (real time streaming protocol, IETF RFC2326) is used 
as a media delivery control protocol, this embodiment 1 will 
be explained. However, this embodiment is not limited to this 
specific example from a workable standpoint. 

In accordance with this embodiment 1, the media 
delivering apparatus selects media data to be delivered based 
on the degree of media importance assigned to each media data 
and then delivers the selected media data to the media receiving 
apparatus so that the user can refer to the important media data 
on a priority basis using the media receiving apparatus. The 
media delivering apparatus may select two or more media data 
to be delivered depending on the degree of media importance 
assigned to each media data. The media delivering apparatus 
delivers the selected media data to the media receiving 
apparatus based on metadata which it dynamically creates 
according to the degree of media importance assigned to the 
media data so that the media receiving apparatus can receive 
the media data only by performing a synchronous presentation 
operation according to the metadata delivered thereto. 

Fig. 2 is a diagram explaining the fundamental structure 
of a media delivery system which uses metadata in accordance 
with embodiment 1 of the present invention. In Fig. 2, the media 
delivering apparatus 201 is connected to the media receiving 
apparatus 202 by way of a network 203 so that it can access the 
degree of media importance 204 assigned to any media data 205 
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and any media data 205. 

Next, the operation of the media delivery system shown 
in Fig. 2 will be explained. 

There is one or more media data 205 in the media delivery 
5 system, and a degree of media importance 204 is assigned to each 
of the one or more media data 205. Each media data 205 has an 
address which is uniquely discriminable within the system. 
When the media delivering apparatus 201 receives a request for 
delivery of media data 205 from the media receiving apparatus 

10 202, the media delivering apparatus 201 refers to the degree 
of media importance 204 assigned to each media data, selects 
one or more media data 205 to be delivered by setting the degree 
of media importance 204 referred as a parameter, creates 
metadata 206 in which address information indicating the 

15 location of each of the selected one or more media data 205 and 
presentation layout information indicating a layout of the one 
or more media data on the screen of the media receiving apparatus 
202 are described, and carries out a delivery 207 of the selected 
one or more metadata 206 to the media receiving apparatus 202. 

20 The media delivering apparatus 201 creates metadata 206 

by setting the degree of media importance 204 assigned to each 
of the selected one or more metadata as parameters. For example, 
the media delivering apparatus 201 selects media data 205 based 
on a threshold indicating "a medium degree of importance", and 

25 creates metadata 206. The media receiving apparatus 202 
receives the metadata 206, and issues a request 208 for delivery 
of the media data 205 based on the address information described 
in the metadata. When receiving the request 208, the media 
delivering apparatus 201 sends a request response 209 to the 

30 media receiving apparatus 202, and carries out a delivery 210 
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of the requested media data 205 to the media receiving apparatus. 
The media receiving apparatus 202 receives the media data 205 
delivered thereto, and presents it based on the presentation 
layout information described in the metadata 206. When 
receiving two or more media data 205 simultaneously, the media 
receiving apparatus 202 presents them while bringing the two 
or more media data 205 into synchronization with one another. 

Fig. 3 is a block diagram showing the internal structure 
of the media delivering apparatus 201. The media delivering 
apparatus 201 is provided with a media selecting unit 2706, a 
transmission-data generating unit 2707, a data transmitting 
unit 2708, and a media communication unit 2712, and can access 
the degree of media importance 204 assigned to any media data 
205 and any media data 205. 

Next, the operation of the media delivering apparatus 
shown in Fig. 3 will be explained. 

The media selecting unit 2706 of the media delivering 
apparatus 201 refers to the degree of media importance 204 
assigned to each media data, and selects media data 205 to be 
delivered by setting the degree of media importance 204 referred 
as a parameter. The transmission-data generating unit 2707 
creates metadata 206 in which both address information 
indicating a location of each media data 205 selected by the 
media selecting unit 2706, and presentation layout information 
indicating a layout of the selected media data on the screen 
of the media receiving apparatus 202, which is determined based 
on the degree of media importance 204 assigned to each selected 
media data 205 are described, and the data transmitting unit 
2708 carries out a delivery 207 of the metadata 206 which is 
created by the transmission-data generating unit to the media 
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receiving apparatus 202. 

The media receiving apparatus 202 receives the metadata 
206 delivered thereto, and issues a request 208 for delivery 
of the media data 205 based on the address information described 
in the metadata 206. The media communication unit 2712 of the 
media delivering apparatus 201 sends a request response 209 to 
the media receiving apparatus 202 in response to the request 
208, and carries out a delivery 210 of the requested media data 
205 to the media receiving apparatus. The media receiving 
apparatus 202 then receives the media data 205 delivered thereto, 
and provides a synchronous presentation of the media data based 
on the presentation layout information which is described in 
the metadata 206. 

Fig. 4 is a block diagram showing the internal structure 
of the media communication unit 2712 of the media delivering 
apparatus 201. The media communication unit 2712 is provided 
with an RTSP communication unit 2805 and an RTP transmitting 
unit 2808. 

Next, the operation of the media communication unit shown 
in Fig. 4 will be explained. 

In response to the request 208 for delivery of the media 
data 205, which the media receiving apparatus 202 has 
transmitted to the media delivering apparatus, the RTSP 
communication unit 2805 sends the request response 209 to the 
media receiving apparatus 202 according to RTSP, and instructs 
the RTP transmitting unit 2808 to deliver the requested media 
data 205 to the media receiving apparatus . The RTP transmitting 
unit 2808 then carries out a delivery 210 of the media data 205 
according to the command from the RTSP communication unit 2805. 

Fig. 5 is a block diagram showing the internal structure 
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of the media receiving apparatus 202. The media receiving 
apparatus 202 is provided with a data receiving unit 2909, a 
data analyzing unit 2910, a synchronous management unit 2911, 
an RTSP communication unit 2912, a media display unit 2913, and 
5 a media receiving unit 2914. 

Next, the operation of the media receiving apparatus 
shown in Fig. 5 will be explained. 

The data receiving unit 2909 of the media receiving 
apparatus 202 receives the metadata 206 delivered from the media 
10 delivering apparatus 201, and delivers it to the data analyzing 
unit 2910. Address information indicating the location of each 
media data 205, and presentation layout information indicating 
a layout of the media data on the screen are described in the 
metadata 206, and synchronization information indicating a 
15 timing for switching among screen displays when needed can be 
included in the metadata 206. The data analyzing unit 2910 
analyzes the metadata 206 received by the data receiving unit 
2909, notifies the address information indicating the location 
of each media data 205, which is described in the metadata 206, 
20 to the RTSP communication unit 2912, and, when the 
synchronization information is included in the metadata 206, 
creates a list of timings at which the media receiving apparatus 
issues a request for delivery of the media data 205, notifies 
it to the synchronous management unit 2911, and notifies the 
25 presentation layout information about the media data 205 which 
is described in the metadata 206 to the media display unit 2913. 
When the synchronization information is not included in the 
metadata 206, the data analyzing unit 2910 creates a list of 
request timings at which the media receiving apparatus issues 
30 a request for delivery of the media data at the time of start 
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of presentation, and notifies it to the synchronous management 
unit 2911. 

The synchronous management unit 2 911 provides a request 
timing to the RTSP communication unit 2912 based on the list 
of request timings notified from the data analyzing unit 2910. 
The RTSP communication unit 2912 issues a request 208 for 
delivery of the media data 205 at the request timing provided 
from the synchronous management unit 2911 based on the address 
information notified from the data analyzing unit 2910. 

When receiving this request 208 from the media receiving 
apparatus 202 , the media delivering apparatus 201 sends a 
request response 209 to the media receiving apparatus 202, and 
carries out a delivery 210 of the requested media data 205 to 
the media receiving apparatus. The media receiving unit 2914 
of the media receiving apparatus 202 receives the media data 
205 delivered thereto, and delivers the media data to the media 
display unit 2913. The media display unit 2913 receives the 
media data 205 from the media receiving unit 2914, and provides 
a synchronous presentation of the media data based on the 
presentation layout information notified from the data 
analyzing unit 2910. 

As a concrete example of the media delivery system shown 
in Fig. 2, a surveillance system will be explained. 

For example, in order to deliver media data 205 about each 
of four surveillance video images using four surveillance video 
cameras, the media delivering apparatus assigns a degree of 
media importance 204 to each media data in advance according 
to the place in which the corresponding surveillance camera is 
located. In this case, the media delivering apparatus 201 
selects media data 205 about a surveillance video image to be 
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delivered by setting the degree of media importance 204 assigned 
to each media data as a parameter, creates metadata 206 in which 
address information indicating the location of the selected 
media data 205 and presentation layout information are 
described, and carries out a delivery 207 of the created 
metadata 206 to the media receiving apparatus 202. 

Fig. 6 is a diagram showing an example of the presentation 
layout in a case of simultaneously monitoring four points (i.e., 
points A to D) . In a case of a list of degrees of importance 
706, a presentation layout in which a video image 702 of point 
A, a video image 703 of point B, a video image 704 of point C, 
and a video image 705 of point D are displayed is formed on the 
entire screen 701 of the media receiving apparatus 202. In 
contrast, in a case of a list of degrees of importance 708, a 
presentation layout in which only the video image 702 of point 
A is displayed is formed on the entire screen. 

The media delivering apparatus 201 refers to the degree 
of media importance 204 assigned to each media data, sets a 
certain degree of media importance 204 as a threshold, and 
selects media data 205 to be delivered dynamically. For example, 
when delivering media data 205 about one or more surveillance 
points whose degree of media importance 204 is equal to or 
greater than "medium degree of importance", the media 
delivering apparatus 201 dynamically creates metadata 206 which 
indicates a simultaneous delivery of all the media data 205 
about the video images 702 to 705 of points A to D in a case 
of the list of degrees of importance 706. On the other hand, 
the media delivering apparatus 201 dynamically creates metadata 
206 which indicates a delivery of only the media data 205 about 
the video image 702 of point A in a case of the list of degrees 
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of importance 708. Thus, based on the degree of media 
importance 204 which is assigned to each media data at the time 
when the user of the media receiving apparatus 202 makes a 
request of the media delivering apparatus 201 for delivery of 

5 media data 205, the media delivering apparatus 201 creates 
metadata 206 dynamically. 

Fig. 7 is a diagram showing an example of the metadata 
206, and this metadata 1301 is used for delivering media data 
205 about video images to be displayed on the entire screen 701 

10 of Fig. 6. In this case, although the metadata is written in 
a multimedia synchronous description language SMIL, the 
description language for use with the present invention is not 
limited to SMIL. In the metadata 1301 of Fig. 7, an smil tag 
1302 shows that up to a text closed by </smil> is SMIL data. 

15 A head tag 1303 shows that up to a text closed by </head> is 
a header portion. A layout tag 1304 shows that up to a text 
closed by </layout> is a description about the presentation 
layout information. A root-layout tag 1305 shows the size of 
the entire screen, and width shows the horizontal width of the 

20 screen and height shows the vertical width of the screen. A 
region tag 1306 is a tag which defines a region in the screen, 
id shows id for identification of the region, left shows an 
offset value in a horizontal direction, top shows an offset 
value in a vertical direction, width shows the horizontal width 

25 of the region, and height shows the vertical width of the region. 
A body tag 1307 shows that up to a text closed by </body> is 
a body portion. A par tag 1308 shows that a simultaneous 
presentation of media data specified by up to a text closed by 
</par> is to be provided. A video tag 1309 is a tag which 

30 specifies media data 205 to be displayed, region specifies id 
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of the region tag set by the presentation layout information 
and shows that the media data is to be presented in the region, 
and src shows the address information indicating the location 
of the media data 205. 

The media receiving apparatus 202 receives the metadata 
1301 of Fig. 7, and issues a request 208 for delivery of the 
media data 205 based on the address information described in 
the metadata. When receiving the request 208 from the media 
receiving apparatus, the media delivering apparatus 201 carries 
out a delivery 210 of the media data 205 to the media receiving 
apparatus 202. The media receiving apparatus 202 receives the 
media data 205 delivered thereto and carries out synchronous 
presentation of the media data so that they can have a 
presentation layout on the entire screen 701 as shown in Fig. 
6, which is described by the metadata 1301. 

Although Fig. 6 shows an example of monitoring four points, 
as an example of surveillance application, there can be provided 
an application example of, when monitoring two or more sections, 
for example, two or more buildings, the above-mentioned four 
points belonging to the same section, creating metadata 206 
dynamically based on the degree of media importance 204 assigned 
to each media data about the section when triggered by a request 
for delivery of video images captured for the section which is 
issued by the user of the media receiving apparatus 202, and 
delivering the metadata to the media receiving apparatus 202 
by means of the media delivering apparatus 201. In the case 
of such a surveillance application example, the media receiving 
apparatus 202 can allow the user to select a section for which 
it makes a request for delivery of video images, and the media 
delivering apparatus 201 creates metadata 206 based on the 
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degree of media importance 204 which is effective only in the 
section selected by the user. 

When the media delivering apparatus 201 creates metadata 
206 dynamically, the media receiving apparatus 202 can 
selectively present only media data 205 which is important at 
that time by only issuing a request 208 for delivery of the media 
data based on the metadata 206. In other words, the media 
receiving apparatus 202 simply presents the media data 205 
according to the metadata 206. According to this structure, 
if the media receiving apparatus 202 can interpret the metadata 
206 so as to issue a request 208 for delivery of the media data, 
the media receiving apparatus 202 can receive the media data 
205 at a correct timing even though the media receiving 
apparatus 202 is of whatever type. In addition, if the media 
receiving apparatus 202 has a decoder for decoding the media 
data 205, the media receiving apparatus 202 can provide a 
synchronous presentation of the media data 205 based on the 
metadata 206 even though the media receiving apparatus 202 is 
of whatever type . However, when there is a large amount of media 
data 205 to be delivered, there is a possibility that the media 
receiving apparatus 202 becomes overloaded depending upon the 
communication capability of the network 203 and the receiving 
capability of the media receiving apparatus 202, and 
presentation of media data is performed correctly. Next, the 
media delivery system which is so constructed as to prevent the 
media receiving apparatus from becoming overloaded will be 
explained. 

Fig. 8 is a diagram explaining a media delivery system 
which uses metadata in accordance with a variant of embodiment 
1 of the present invention. In Fig. 8, a media delivering 
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apparatus 301 is connected to a media receiving apparatus 202 
by way of a network 203, and can access the degree of media 
importance 204 assigned to any media data 205 and any media data 
205. 

Next, the operation of the media delivery system shown 
in Fig. 8 will be explained. 

There is one or more media data 205 in the system, and 
a degree of media importance 204 is assigned to each media data 
205. Each media data 205 has an address which is uniquely 
discriminable within the system. The number of various 
parameters for generating metadata 306 automatically in the 
media delivering apparatus 301 of Fig. 8 is large compared with 
that of the media delivering apparatus 201 of Fig. 2. To be 
more specific, when receiving a request for delivery of media 
data 205 from the media receiving apparatus 202, the media 
delivering apparatus 301 uses one or more of parameters shown 
in the following (b) to (d) in addition to a parameter shown 
in the following (a) : 

(a) The media importance 204 

(b) The communication capability 312 of the network 203 disposed 
between the media delivering apparatus 301 and the media 
receiving apparatus 202 

(c) The screen size as the receiving capability 313 of the media 
receiving apparatus 202 

(d) The decoding capacity, such as the number of decoders and 
the encoding rate, as the receiving capability 313 of the media 
receiving apparatus 202 

Thus, by using one or more of the parameters shown in (b) 
to (d) in addition to the parameter shown in (a) , and by taking 
into consideration not only the degree of media importance 204 
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but the communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 202, 
the media delivering apparatus selects one or more media data 
205 to be delivered, creates metadata 306 in which address 
5 information about the location of each selected media data 205 
and presentation layout information are described, and carries 
out a delivery 207 of the metadata 306 to the media receiving 
apparatus 202. The media receiving apparatus 202 receives the 
metadata 306, and issues a request 208 for delivery of the media 

10 data 205 based on the address information described in the 
metadata. The media delivering apparatus 301 receives the 
request 208, sends a request response 209 to the media receiving 
apparatus 202, and carries out a delivery 210 of the media data 
205 to the media receiving apparatus. The media receiving 

15 apparatus 202 receives the media data 205 delivered thereto, 
and provides a synchronous presentation of the media data 205 
based on the presentation layout information described in the 
metadata 306. 

In the media delivering apparatus 301 of Fig. 8, the 
20 selection of media data 205 to be delivered, the number of media 
data 205 to be delivered at a time, and the quality of the media 
data 205 to be delivered are based on the parameters shown in 
the above-mentioned (a) to (d) . To be more specif ic, the higher 
importance media data 205 has, the higher probability it is 
25 selected, and the higher communication capability 312 the 
network 203 has or the higher receiving capability 313 the media 
receiving apparatus 202 has, the larger number of media data 
205 is delivered at a time and the higher quality the media data 
205 has. 

30 Fig. 9 is a block diagram showing the internal structure 
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of the media delivering apparatus 301. This media delivering 
apparatus 301 differs from the media delivering apparatus 201 
of Fig. 3 in that it additionally has a parameter acquiring unit 
3007, and includes a media selecting unit 3006 instead of the 
5 media selecting unit 2706 and a transmission-data generating 
unit 3008 instead of the transmission-data generating unit 
2707. 

Next, the operation of the media delivering apparatus 
shown in Fig. 9 will be explained. 

10 The parameter acquiring unit 3007 acquires the parameters 

shown in the above-mentioned (b) to (d) , i.e., at least one of 
the communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 202 
using a certain means, and notifies the acquired parameter to 

15 the media selecting unit 3006. Although as this means for 
acquiring parameters, a communicating means for carrying out 
communications to exchange the communication capability or 
receiving capability with the media receiving apparatus 202 in 
advance, for example, can be provided, the means for acquiring 

20 parameters of the present invention is not limited to the 
communicating means. 

The media selecting unit 3006 determines media data 205 
to be delivered based on both the degree of media importance 
204 and at least one of the communication capability 312 of the 

25 network 203 and the receiving capability 313 of the media 
receiving apparatus 202. The transmission-data generating 
unit 3008 creates metadata 306 in which both address information 
indicating the location of each media data 205 selected by the 
media selecting unit 3006, and presentation layout information 

30 indicating a layout of the selected media data on the screen 
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of the media receiving apparatus 202, which is determined based 
on both the degree of media importance 204 assigned to the 
selected media data 205 , and at least one of the communication 
capability 312 of the network 203 and the receiving capability 

5 313 of the media receiving apparatus 202 are described. The 
other operation of the media delivering apparatus is the same 
as that of the media delivering apparatus 201 of Fig. 3. 

A surveillance system will be explained as a concrete 
example of the media delivery system shown in Fig. 8. An 

10 explanation will be given for a case in which the media 
delivering apparatus delivers media data to the media receiving 
apparatus 202 having a high receiving capability by way of the 
network 203 having a high communication capability by using, 
for example, all the parameters shown in the above-mentioned 

15 (a) to (d) in the case of Fig. 8. In this example, it is assumed 
that the network 203 has a communication band of 1.5Mbps, and 
the media receiving apparatus 202 has a resolution of XGA 
(1024x768) and a decoding capability of decoding media data 205 
about ten video images simultaneously. 

20 Fig. 10 is a diagram showing an example of the presentation 

layout in a case of simultaneously monitoring six points (e.g., 
points A to F) . Based on a list of degrees of importance 607, 
a presentation layout in which a video image 602 of point A, 
a video image 603 of point B, a video image 604 of point C, a 

25 video image 605 of point D, and a video image 606 of point E 
are displayed is formed on the entire screen 601 of the media 
receiving apparatus. As shown in the list of degrees of 
importance 607, in the presentation layout on the entire screen 
601, the video image of point E is a surveillance video image 

30 having a "high degree" of media importance, and the video images 
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of points A to D are surveillance video images having a "medium 
degree" of media importance. Since the media receiving 
apparatus 202 has a sufficiently large screen size and a high 
decoding capability, the media receiving apparatus 202 sets the 
5 threshold for media importance to be "medium" and performs 
monitoring of the five points except the point F having a "low 
degree" of media importance by displaying high-quality video 
images because the network 203 has a high communication 
capability, while the media receiving apparatus performs 

10 monitoring of the point E having a "high degree" of media 
importance by displaying a high-quality high-resolution (1Mbps 
and VGA resolution) video image of the point E, so that the media 
receiving apparatus forms a presentation layout in which a 
thumbnail display (64k bps and SQVGA resolution) of the points 

15 A to D is produced, where VGA resolution is 640x480, and SQVGA 
resolution is 160x120. The media receiving apparatus thus 
carries out simultaneous presentation of the five video images. 

In such a case, the media delivering apparatus 301 creates 
metadata 306 used for delivery of surveillance video images, 

20 which will generate a presentation layout as shown in Fig. 10, 
and delivers the metadata to the media receiving apparatus 202. 

Fig. 11 is a diagram showing an example of the metadata 
306 which is used for delivering the media data 205 about video 
images to be displayed on the entire screen 601 of Fig. 10 . Each 

25 tag included in the metadata 1201 of Fig. 11 has the same meaning 
as a corresponding tag included in the metadata 1301 which is 
already explained with reference to Fig. 7. 

The media receiving apparatus 202 receives the metadata 
1201 as shown in Fig. 11, and issues a request 208 for delivery 

30 of the media data 205 based on the address information described 



23 

in the metadata. The media delivering apparatus 301 receives 
the request 208, and performs a delivery 210 of the media data 
205 to the media receiving apparatus 202. The media receiving 
apparatus 202 receives the media data 205 delivered thereto, 

5 and provides a synchronous presentation of the received media 
data in the form of the presentation layout shown in Fig. 10 
based on the presentation layout information described in the 
metadata 1201 as shown in Fig. 11. 

On the other hand, also in the case of monitoring six 

10 points based on the same list of degrees of importance 607 of 
Fig. 10, the media delivering apparatus 301 creates metadata 
306 which differs from the metadata 1201 of Fig. 11 if the 
communication capability 312 of the network 203 is low, for 
example, if the transmission rate is 32 bps. When the 

15 communication capability 312 of the network 203 is low, the 
media delivering apparatus 301 sets the threshold for media 
importance 204 to be "high" instead of "medium", and restricts 
the number of media data 205 which are to be delivered to the 
media receiving apparatus first. In the example of the 

20 importance list 607 of Fig. 10, when the threshold for media 
importance is set to be "high", the number of media data which 
are to be delivered to the media receiving apparatus is 
restricted to one, i.e., only the video data about the point 
E is delivered to the media receiving apparatus. After doing 

25 so, the media delivering apparatus 301 creates dynamically 
metadata 306 used for delivering the video data of 32k bps about 
the point E to the media receiving apparatus 202, and then 
performs a delivery 207 of the video data to the media receiving 
apparatus . 

30 Next, a case in which all the parameters shown in 
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above-mentioned (a) to (d) are used, and the communication 
capability 312 of the network 203 and the receiving capability 
313 of the media receiving apparatus 202 are all low in the 
example of Fig. 8 will be explained. In this example, it is 

5 assumed that the network 203 has a low communication capability 
312 which is of order of 32kbps, and a resolution SQVGA (160x120), 
and the media receiving apparatus 202 has, as the receiving 
capability 313, a low decoding capability of decoding only one 
video data at a time. In such a case, the media delivering 

10 apparatus 301creates metadata 1401 as shown in Fig. 13, and 
performs a delivery 207 of the metadata to the media receiving 
apparatus 202 in order to carry out delivery of video images 
as shown in Fig. 12. 

Fig. 12 is a diagram showing an example of the presentation 

15 layout in the case of monitoring four points (e.g., points. A 
to D) . In this example, a video image 801 of point A, a video 
image 802 of point B, a video image 803 of point C, a video image 
804 of point D, and a video image 805 of point A are transmitted 
one by one to the media receiving apparatus and are presented 

20 one by one by the media receiving apparatus based on a list of 
degrees of importance 806. 

In the example shown in Fig. 12, since there is no 
difference among the degrees of media importance 204 assigned 
to the plurality of surveillance points, as shown in the 

25 importance list 806, and the media receiving apparatus 202 has 
a small screen size and a low decoding capability, the media 
receiving apparatus 202 presents only a video image of one point 
at a time. In addition, since the communication capability 312 
of the network 203 is low, the media delivering apparatus 

30 monitors each surveillance point to create a video image of the 
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point at a lower bit rate. Therefore, since there is no 
difference among the degrees of media importance 204 
respectively assigned to the four surveillance points, the 
media delivering apparatus sequentially switches among the 
5 video images of the four points in such a manner as 
A->B->C->D->A-> ... at predetermined intervals. The media 
delivering apparatus 301 thus creates metadata 306 for 
implementing such a presentation layout based on the low 
resolution of the media receiving apparatus 202, and performs 
10 a delivery 207 of the metadata to the media receiving apparatus 
202. 

Fig. 13 is a diagram showing an example of the metadata 
306, and shows an example for implementing a presentation layout 
as shown in Fig. 12. In the metadata 1401, an seq tag 1402 shows 

15 that media data 205 specified by a text closed by </seq> are 
presented in order, and repeat = "indefinite" shows that the 
presentation of the media data is repeated an infinite number 
of times. A video tag 1403 is a tag which defines media data 
205, region specifies id of a region tag set by the presentation 

20 layout information and shows that the media data is to be 
presented in the region, src shows the address information 
indicating the location of the media data 205, and dur shows 
a presentation time. 

In Fig. 8, the media receiving apparatus 202 receives the 

25 metadata 1401 as shown in Fig. 13, and issues a request 208 for 
delivery of the media data 205 based on the address information 
described in the metadata. The media delivering apparatus 301 
receives the request 208, and performs a delivery 210 of the 
media data 205 to the media receiving apparatus 202. The media 

30 receiving apparatus 202 receives the media data 205 delivered 
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thereto, and presents the media data in the form of the 
presentation layout of Fig. 12 which is described in the 
metadata 1401. The timing at which the media receiving 
apparatus switches among the corresponding video images is 

5 described in dur of the video tag 1403 included in the metadata 
1401 of Fig. 13. The example of the metadata 1401 of Fig. 13 
is equivalent to a case where n is set to 30 seconds in Fig. 
12. The media receiving apparatus 202 issues a request 208 for 
delivery of the media data 205 at the timing at which it switches 

10 among the video images based on the address information 
described in the metadata 1401, and switches among the presented 
images one by one. 

In the example of Fig. 8, when all the parameters shown 
in above-mentioned (a) to (d) are used, the communication 

15 capability 312 of the network 203 and the receiving capability 
313 of the media receiving apparatus 202 are all low, and there 
is a difference among the degrees of importance respectively 
assigned to the surveillance points, the media delivering 
apparatus 301 creates metadata 1601 as shown in Fig. 15 in order 

20 to carry out a delivery of video images as shown in Fig. 14, 
for example, and performs a delivery 207 of the metadata to the 
media receiving apparatus 202. 

Fig. 14 is a diagram showing an example of the presentation 
layout in the case of monitoring three points (e.g., points A 

25 to C) . In this example, a video image 901 of point B, a video 
image 902 of point C, a video image 903 of point A, and a video 
image 904 of point B are transmitted one by one to the media 
receiving apparatus and are presented one by one by the media 
receiving apparatus based on a list of degrees of importance 

30 905. 
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In the example shown in Fig. 14, since the degree of media 
importance 204 of the video image of the point B is "high", and 
the degree of media importance 204 of the video image of each 
of the points A and B is "medium", as shown in the importance 
5 list 905, and the media receiving apparatus 202 has a small 
screen size and a low decoding capability, the media receiving 
apparatus presents only a video image of one point at a time. 
In addition, since the communication capability 312 of the 
network 203 is low, the media delivering apparatus monitors each 

10 surveillance point to create a video image of the point at a 
lower bit rate. Therefore, since the above-mentioned example 
is the case where the three points including the point B having 
a high degree of media importance 204 are monitored, the media 
delivering apparatus sets the presentation time of the video 

15 image of the point B to be longer than that of the video images 
of the points A and C, and sequentially switches among the video 
images of the three points in such a manner as B->C->A->B-> .... 

Fig. 15 is a diagram showing an example of the metadata 
306, and shows an example for implementing the presentation 

20 layout of Fig. 14. Each tag included in this metadata 1601 is 
the same as a corresponding tag of each above-mentioned 
metadata . 

In Fig. 8, the media receiving apparatus 202 receives the 
metadata 1601 as shown in Fig. 15, and issues a request 208 for 

25 delivery of the media data 205 based on the address information 
described in the metadata. The media delivering apparatus 301 
receives the request 208, and carries out a delivery 210 of the 
media data 205 to the media receiving apparatus 202. The media 
receiving apparatus 202 receives the media data 205 delivered 

30 thereto, and presents the media data in the form of the 
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presentation layout shown in Fig. 14 which is described in the 
metadata 1601. The timing at which the media receiving 
apparatus switches among the corresponding video images is 
described in the metadata 1601 of Fig. 15. This example of the 
5 metadata 1601 is equivalent to a case where m is set to 60 seconds 
and n is set to 30 seconds in Fig. 14. The media receiving 
apparatus 202 issues a request 208 for delivery of the media 
data 205 at the timing at which it switches among the video images 
based on the address information described in the metadata 1601, 

10 and switches among the presented images one by one. 

The metadata 306 which the media delivering apparatus 301 
of Fig. 8 creates can be described in a form including a 
conditional branch for determining the presentation layout. 
This conditional branch can describe conditions concerning not 

15 only the parameters shown in the above-mentioned (b) to (d) , 
but also other parameters. The media receiving apparatus 202 
analyzes the metadata 306 in which a conditional branch is 
described and which is delivered from the media delivering 
apparatus 301 so as to judge conditions in the conditional 

20 branch, selects media data 205 to be received and a presentation 
layout based on results of the judgment, and issues a request 
208 for delivery of the selected media data 205 based on address 
information about the location of the media data 205 to be 
received. The media delivering apparatus 301 then performs a 

25 delivery 210 of the media data 205 to the media receiving 
apparatus 202 in response to the request 208. The media 
receiving apparatus 202 receives the media data 205 delivered 
thereto, and presents the media data in the form of the 
presentation layout which is selected thereby based on the 

30 above-mentioned j udgment . 



29 

Fig. 16 is a diagram showing an example of the metadata 
306 in which a conditional branch for determining the 
presentation layout is included. In the metadata 1501 of Fig. 
16, a switch tag 1502 is a tag for causing the media receiving 
5 apparatus to sequentially examine conditions placed in tags 
included in a text closed by </switch> in the order that they 
appear in the text and to leave a corresponding condition intact, 
a layout tag 1503 shows that a text closed by </layout> describes 
a presentation layout, systemBitrate shows requirements on the 

10 communication capability 312 of the network 203, and a par tag 
1504 shows that the media data 205 specified by a text closed 
by </par> are to be simultaneously presented. 

This metadata 1501 is an example of the metadata 306 which 
enables the media receiving apparatus 202 to select a 

15 presentation layout as shown in Fig. 10 or 12 under the 
conditions specified by the parameters shown in the 
above-mentioned (b) . To be more specific, under the conditions 
specified by the parameters shown in the above-mentioned (b) , 
the media receiving apparatus 202 is allowed to selectively 

20 issue either a request 208 for delivery of the media data 205 
in such a manner that a presentation layout as shown in Fig. 
10 is produced when the media receiving apparatus determines 
that the network 203 has a high communication capability 312, 
or a request 208 for delivery of the media data 205 in such a 

25 manner that a presentation layout as shown in Fig. 12 is produced 
when the media receiving apparatus determines that the network 
203 has a low communication capability 312. 

Thus, the media delivering apparatus 301 can deliver the 
media data 205 to the media receiving apparatus 202 in 

30 consideration of the degree of media importance 204 assigned 
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to the media data 205, the communication capability 312 of the 
network 203, and the receiving capability 313 of the media 
receiving apparatus 202, by creating metadata 306 based on the 
parameters shown in above-mentioned (b) to (d) . 
5 Next, processing in a case where the degree of media 

importance 204 is changed while a presentation of media data 
205 is provided will be explained. When the degree of media 
importance 204 is changed, there is a necessity to carry out 
a process of delivering media data 205 whose degree of media 
10 importance 204 has been changed to a higher degree on a priority 
basis . 

Fig. 17 is a diagram for explaining another variant of 
the media delivery system which uses metadata in accordance with 
embodiment 1 of the present invention. In Fig. 17, the media 

15 delivering apparatus 401 is connected to the media receiving 
apparatus 402 by way of the network 203, and can access the degree 
of media importance 204 assigned to any media data 205 and any 
media data 205. 

Next, the operation of the media delivery system shown 

20 in Fig. 17 will be explained. 

Fig. 17 shows a case where a change indication 414 is 
issued for the degree of media importance 204 assigned to media 
data 205 at a certain time while the media delivery system 
performs a delivery of media data after carrying out a procedure 

25 as shown in Fig. 8. The media delivering apparatus 401 of Fig. 
17 differs from the media delivering apparatus 301 of Fig. 8 
in that it additionally has a function of changing the degree 
of media importance 204 assigned to each media data. To be more 
specific, when changing the degree of media importance 204 

30 assigned to media data, the media delivering apparatus 401 uses 
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a parameter shown in the following (e) : 

(e) A changed degree of media importance 204 

For example, the media delivering apparatus uses one or 
more of the parameters shown in the above-mentioned (b) to (d) , 
5 as well as the parameter shown in the above-mentioned (e) , 
selects one or more media data 205 in consideration of not only 
the changed degree of media importance 204 but also the 
communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 202, 
10 updates the metadata 306 in which the address information about 
the selected media data 205 and presentation layout information 
are described, creates metadata 406, and performs a delivery 
207 of the updated metadata 406 to the media receiving apparatus 
402. 

15 As compared with the media receiving apparatus 202 of Fig. 

8, the media receiving apparatus 402 additionally has a function 
of analyzing the change command. The details of the change 
command will be mentioned later. The media receiving apparatus 
402 receives the above-mentioned metadata 406, and suspends 

20 presentation of the media data 205 currently being presented, 
and issues a request 208 for delivery of media data 205 based 
on the address information described in the metadata 406 which 
it has newly received. The media delivering apparatus 401 
receives the request 208, and sends a request response 209 to 

25 the media receiving apparatus 402 and then performs a delivery 
210 of the media data 205 to the media receiving apparatus. The 
media receiving apparatus 402 receives the media data 205 
delivered thereto, and provides a synchronous presentation of 
the media data in the form of a presentation layout described 

30 in the metadata 406 which it has newly received. 
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The media delivering apparatus 401 can calculate, as the 
change command 406, a difference between the metadata 306 which 
is generated before the degree of media importance 204 is 
changed and the metadata which is generated after the degree 

5 of media importance 204 is changed, instead of the 
above-mentioned metadata 406, and can perform a delivery 207 
of the change command to the media receiving apparatus . In this 
case, the media receiving apparatus 402 interprets the change 
command 406 to update the metadata 306 about the media data 

10 currently being presented. As a result, the same advantage is 
provided, as in the case where the whole metadata 406 which is 
changed as mentioned above is sent to the media receiving 
apparatus. In general, since the change command 406 has a 
smaller amount of data than the whole metadata 406, and provides 

15 the same advantage, it comes in useful. 

For example, reception of alarm information notified from 
a point under surveillance to the media delivering apparatus 
401 is cited as a factor that causes the change indication 414 
for changing the degree of media importance 204 assigned to 

20 media data. However, this invention is not limited to this 
specific example from a workable standpoint. 

Fig. 18 is a block diagram showing the internal structure 
of the media delivering apparatus 401 of Fig. 17. This media 
delivering apparatus 401 differs from the media delivering 

25 apparatus 301 of Fig. 9 in that it additionally has an importance 
change monitoring unit 3106, and includes a media selecting unit 
3107 instead of the media selecting unit 3006 and a 
transmission-data generating unit 3109 instead of the 
transmission-data generating unit 3008. 

30 Next, the operation of the media delivering apparatus 
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shown in Fig. 18 will be explained. 

The importance change monitoring unit 3106 always 
monitors whether or not a change indication 414 for changing 
the degree of media importance 204 assigned to media data is 
issued, and, when a change command for changing the degree of 
media importance 204 assigned to media data is issued, changes 
the degree of media importance 204, and notifies the media 
selecting unit 3107 that the degree of media importance 204 has 
been changed. The media selecting unit 3107 receives the 
notification, selects media data 205 based on both the changed 
degree of media importance 204 and at least one of the 
communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402, 
and notifies the selected media data to the transmission-data 
generating unit 3109. 

As compared with the transmission-data generating unit 
3008, the transmission-data generating unit 3109 further has 
a function of creating a change command 406 indicating the 
difference between the yet-to-be-changed metadata 306 and the 
changed metadata. 

The transmission-data generating unit 3109 creates 
changed metadata 406 in which both address information 
indicating the location of each media data 205 selected by the 
media selecting unit 3107, and presentation layout information 
indicating a layout of the selected media data on the screen 
of the media receiving apparatus 402 which is determined based 
on both the changed degree of media importance 204 and at least 
one of the communication capability 312 of the network 203 and 
the receiving capability 313 of the media receiving apparatus 
402 are described, or creates a change command 406 indicating 
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a difference between the yet-to-be-changed metadata 306 and the 
changed metadata based on both the changed degree of media 
importance 204 and at least one of the communication capability 
312 of the network 203 and the receiving capability 313 of the 
5 media receiving apparatus 402, and then delivers either the 
changed metadata or the change command to the data transmitting 
unit 2708. The details of the change command 406 will be 
explained later. Other operations of the media delivering 
apparatus are the same as those of the media delivering 

10 apparatus 301 of Fig. 9. 

Fig. 19 is a block diagram showing the internal structure 
of the media receiving apparatus 402 of Fig. 17. This media 
receiving apparatus 402 of Fig. 5 differs from the media 
receiving apparatus 202 of Fig. 5 in that it includes a data 

15 analyzing unit 3210 instead of the data analyzing unit 2910. 

Next, the operation of the media receiving apparatus 
shown in Fig. 19 will be explained. 

The data analyzing unit 3210 differs from the data 
analyzing unit 2910 of Fig. 5 in that it interprets not only 

20 the metadata 406 but also the change command 406, and 
additionally has a function of comparing the change command with 
the metadata 306 which the media receiving apparatus has already 
received to update the metadata 306. Other operations of the 
media receiving apparatus are the same as those of the media 

25 receiving apparatus 202 of Fig. 5. 

A surveillance system will be explained with reference 
to Fig. 17 as an example of the media delivery system. An 
explanation will be given for a case in which all the parameters 
shown in the above-mentioned (a) to (e) are used, and the 

30 communication capability 312 of the network 203 and the 
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receiving capability 313 of the media receiving apparatus 202 
are all high in the example of Fig. 17. 

Fig. 20 is a diagram showing an example of switching 
between presentation layouts in a case where 6 points (e.g., 
5 points A to F) are simultaneously monitored and the degree of 
media importance 204 assigned to media data is changed. In this 
example, a presentation layout of a video image 1003 of point 
A, a video image 1004 of point B, a video image 1005 of point 

C, a video image 1006 of point D, and a video image 1007 of point 
10 E which is produced on the whole screen 1001 based on a list 

of importance degrees 1013 is switched to a presentation layout 
of a video image 1008 of point A, a video image 1009 of point 
E, a video image 1010 of point C, a video image 1011 of point 

D, and a video image 1007 of point B which is produced on the 
15 whole screen 1002 based on a changed list of importance degrees 

1014. In the presentation layout on the entire screen 1001 
based on the yet-to-be-changed importance list 1013, the video 
image of point E is a surveillance video image having a "high 
degree" of media importance, and the video images of points A 

20 to D are surveillance video images having a "medium degree" of 
media importance. In the presentation layout on the entire 
screen 1002 based on the changed importance list 1014, the video 
image of point B is switched to a surveillance video image having 
the "highest degree" of media importance because an alarm has 

25 been issued from the point B. 

Fig. 21 is a diagram showing an example of changing the 
metadata 204 when the degree of media importance 204 assigned 
to media data is changed, like Fig. 20. In this example, a case 
where metadata 1701 based on a list of importance degrees 1703 

30 is changed to metadata 1702 based on a list of importance degrees 
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1704 because of increase in the importance assigned to the point 
B is shown. 

In a case of assignment of importance as shown in the 
importance list 1013 of Fig. 20, the media delivering apparatus 
5 401 performs a delivery 207 of metadata 1701 of Fig. 21 used 
for delivery of video images, which will generate a presentation 
layout of the video images on the entire screen 1001, to the 
media receiving apparatus 4 02. The media receiving apparatus 
402 receives the metadata 1701, and then issues a request 208 

10 for delivery of the media data 205 based on the address 
information described in the metadata. The media delivering 
apparatus 401 receives the request 208, and performs a delivery 
210 of the media data 205 to the media receiving apparatus 402. 
The media receiving apparatus 402 receives the media data 205 

15 delivered thereto, and provides a synchronous presentation of 
the media data so that they have a presentation layout as shown 
in Fig. 20 on the entire screen 1001 based on the presentation 
layout information described in the metadata 1701. 

After that, when the change indication 414 for changing 

20 the degree of media importance 204 assigned to media, data is 
issued, the media delivering apparatus 401 changes the 
importance list to the importance list 1014 of Fig. 20, and then 
performs a delivery 207 of the metadata 1702 for updating the 
presentation layout to the media receiving apparatus 402 in 

25 order to generate a presentation layout of the media data as 
shown in Fig. 20 on the entire screen 1002. When receiving the 
metadata 1702, the media receiving apparatus 402 suspends the 
presentation of the media data 205 currently being presented, 
and then issues a request 208 for delivery of the media data 

30 205 based on the address information described in the metadata 
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1702 which the media receiving apparatus has newly received. 
The media delivering apparatus 401 receives the request 208, 
and performs a delivery 210 of the media data 205 to the media 
receiving apparatus 402. The media receiving apparatus 402 

5 receives the media data 205 delivered thereto, and provides a 
synchronous presentation of the media data so that they have 
the presentation layout as shown in Fig. 20 on the entire screen 
1002 thereof based on the presentation layout information 
described in the metadata 1702. 

10 Next, an explanation will be given for a case in which 

all the parameters shown in above-mentioned (a) to (e) are used, 
and the communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 202 
are all low in the example of Fig. 17. 

15 Fig. 22 is a diagram showing an example of switching 

between presentation layouts when the degree of media 
importance 204 assigned to media data is changed in a case of 
simultaneously monitoring three points (e.g., points A to C) . 
In this case, when an alarm is issued and then the degree of 

20 media importance 204 assigned to the point B increases while 
a video image 1101 of point A, a video image 1102 of point B, 
a video image 1103 of point C, and a video image 1104 of point 
A are transmitted one by one to the media receiving apparatus 
and are presented one by one by the media receiving apparatus 

25 based on a list of degrees of importance 1110 in which no media 
importance 204 has been changed yet, only a video image 1105 
of point B will be transmitted to the media receiving apparatus 
and will be presented by the media receiving apparatus based 
on the changed list of degrees of importance 1111. After that, 

30 when the alarm is released, and the degree of media importance 
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204 assigned to the point B decreases slightly, a video image 
1106 of point A, a video image 1107 of point B, a video image 
1108 of point C, and a video image 1109 of point A are transmitted 
one by one to the media receiving apparatus and are presented 
5 one by one by the media receiving apparatus based on the 
f urther-changed list of degrees of importance 1112. 

Fig. 23 is a diagram showing an example of change in the 
metadata in a case where the degree of media importance 204 
assigned to media data is changed, like Fig. 22. In this example, 

10 a case where metadata 1801 based on a list of importance degrees 
1804 is changed to metadata 1802 based on a list of importance 
degrees 1805 because of increase in the importance assigned to 
the point B, and is further changed to metadata 1803 based on 
a list of importance degrees 1806 because of slightly decrease 

15 in the importance assigned to the point B is shown. 

This example is the example in which video image of points 
whose importance is greater than "medium degree of importance" 
are presented one by one, and, when a video image of a point 
whose importance is "the highest degree of importance" exists, 

20 only the vide image is presented until the importance assigned 
to the point is changed from "the highest degree of importance" 
to a lower degree of importance. 

In a case of assignment of importance as shown in the 
importance list 1110 of Fig. 22, the media delivering apparatus 

25 401 delivers the metadata 1801 of Fig. 23 used for delivery of 
media data, which causes sequential presentation of the media 
data, such as presentation of the video image 1101 of point A, 
video image 1102 of point B, video image 1103 of point C, and 
video image 1104 of point A, to the media receiving apparatus 

30 402. The media receiving apparatus 402 receives the metadata 
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1801, and issues a request 208 for delivery of the media data 
205 based on the address information described in the metadata. 
The media delivering apparatus 401 receives the request 208 , 
and performs a delivery 210 of the media data 205 to the media 
5 receiving apparatus 402. The media receiving apparatus 402 
receives the media data 205 delivered thereto, and presents the 
video image 1101 of point A, video image 1102 of point B, video 
image 1103 of point C, and video image 1104 of point A in the 
order that they are delivered thereto based on the presentation 

10 layout information described in the metadata 1801. The timing 
at which the media receiving apparatus switches among the video 
images is described in the metadata 1801, and the media 
receiving apparatus 402 issues a request 208 for delivery of 
the media data 205 at the timing at which it switches among the 

15 video images based on the address information described in the 
metadata, and switches among the presented images one by one. 

After that, when an alarm is issued from the point B and 
the importance list is changed to an importance list 1111 of 
Fig. 22 as a result, the media delivering apparatus 401 delivers 

20 the metadata 1802 of Fig. 23 which is used for changing the 
presentation layout to the media receiving apparatus 402 in 
order to cause presentation of only the video image of point 
B. When receiving the metadata 1802, the media receiving 
apparatus 402 suspends the presentation of the media data 205 

25 currently being presented, and then issues a request 208 for 
delivery of the media data 205 based on the address information 
described in the metadata 1802 which the media receiving 
apparatus has newly received. The media delivering apparatus 
401 receives the request 208, and performs a delivery 210 of 

30 the media data 205 to the media receiving apparatus 402. The 
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media receiving apparatus 402 receives the media data 205 
delivered thereto, and presents only the video image 1105 of 
point B based on the presentation layout information described 
in the metadata 1802 which the media receiving apparatus has 
5 newly received. 

Furthermore, after that, when an alarm issued from the 
point B is released, and the importance list is changed to an 
importance list 1112 of Fig. 22, the media delivering apparatus 
401 delivers the metadata 1803 of Fig. 23 which is used for 

10 carrying out a delivery of video images and which causes 
sequential presentation of the media data, such as presentation 
of the video image 1106 of point A, video image 1107 of point 
B, video image 1108 of point C, and video image 1109 of point 
A shown in Fig. 22, to the media receiving apparatus 402. When 

15 receiving the metadata 1803 shown in Fig. 23, the media 
receiving apparatus 402 suspends the presentation of the media 
data 205 currently being presented, and then issues a request 
2.08 for delivery of the media data 205 based on the address 
information described in the metadata 1803 which the media 

20 receiving apparatus has newly received. The media delivering 
apparatus 401 receives the request 208, and performs a delivery 
210 of the media data 205 to the media receiving apparatus 402. 
The media receiving apparatus 402 receives the media data 205 
delivered thereto, and sequentially presents the video image 

25 1106 of point A, video image 1107 of point B, video image 1108 
of point C, and video image 1109 of point A based on the 
presentation layout information described in the metadata 1803 
which the media receiving apparatus has newly received. The 
timing at which the media receiving apparatus switches among 

30 the video images is described in the metadata 1803 shown in Fig. 



41 

23, and the media receiving apparatus 402 issues a request 208 
for delivery of the media data 205 at the timing at which it 
switches among the video images based on the address information 
described in the metadata, and switches among the presented 
5 images one by one. 

In Fig. 17 , when the degree of media importance 204 
assigned to media data is changed, the media delivering 
apparatus 401 can use a method of transmitting, as a change 
command 406, only a difference between the yet-to-be-updated 

10 metadata and the updated metadata instead of updating the 
metadata 306 and then delivering the whole of the updated 
metadata 406 to the media receiving apparatus 402. 

Fig. 24 is a diagram showing an example of the definition 
of the format of the change command, and an example of the 

15 metadata. There are an add command (addition), a replace 
command (replacement) , and a delete command (deletion) as 
examples of the change command, and an operation about element 
and attribute can be performed on each command, as shown in a 
change command format list 2401. A time at which a command is 

20 executed can be omitted, and the command is executed immediately 
when the execution time is omitted. An absolute node shows a 
layer position of the metadata and each change command is 
associated with a layer of the metadata 2402 using an absolute 
node correspondence table 2403. In accordance with this 

25 embodiment 1, the change command is not limited to the examples 
shown in Fig. 24. 

An example of using the change command defined as shown 
in Fig. 24 will be explained. 

Fig. 25 is a diagram showing an example of the change 

30 command. When updating the metadata, the media delivering 
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apparatus 401 transmits a change command 1901 which corresponds 
to a difference between the metadata 1701 as shown in Fig. 21 
and the metadata 1702 as shown in Fig. 25 to the media receiving 
apparatus, instead of sending the whole of the changed metadata 
5 1702 of Fig. 21. The media receiving apparatus 402 receives 
the change command 1901 of Fig. 25, refers to the metadata 1701 
of Fig. 21 which it has already received, and changes the 
metadata 1701 based on the change command 1901. When the 
metadata 1701 of Fig. 21 is updated using the change command 

10 1901 of Fig. 25, the metadata 1701 changes to the metadata 1702 
of Fig. 21. When receiving the change command 1901, the media 
receiving apparatus 402 suspends the presentation of the media 
data currently being received, updates the metadata 1701 based 
on the change command 1901, and then issues a request 208 for 

15 delivery of the media data 205 based on the address information 
described in the metadata 1702 to which the metadata 1701 has 
been changed. The media delivering apparatus 401 receives the 
request 208, and performs a delivery 210 of the media data 205 
to the media receiving apparatus 402. The media receiving 

20 apparatus 402 receives the media data 205 delivered thereto, 
and provides a synchronous presentation of the media data based 
on the presentation layout information described in the 
metadata 1702 of Fig. 21 which has been newly created based on 
the change command 1901. 

25 Fig. 26 is also a diagram showing an example of the change 

command. Also when updating the metadata 1802 and 1803, the 
media delivering apparatus 401 transmits both a change command 
2001 of Fig. 26 which corresponds to a difference between the 
metadata 1801 and 1802 as shown in Fig. 23 and a change command 

30 2002 of Fig. 26 which corresponds to a difference between the 
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metadata 1802 and 1803 as shown in Fig. 23 to the media receiving 
apparatus, instead of sending the whole of the changed metadata 
1802 and 1803. The media receiving apparatus 402 receives the 
change command 2001 of Fig. 26, refers to the metadata 1801 of 
5 Fig. 23 which it has already received, and changes the metadata 
1801 based on the change command 2001. When the metadata 1801 
of Fig. 23 is updated using the change command 2001 of Fig. 20, 
the metadata 1801 changes to the metadata 1802 of Fig. 23. 
Similarly, when the metadata 1802 of Fig. 23 is updated using 

10 the change command 2002 of Fig. 26, the metadata 1802 changes 
to the metadata 1803 of Fig. 23. Every time when receiving the 
change commands 2001 and 2002, the media receiving apparatus 
402 suspends the presentation of the media data currently being 
received, and issues a request 408 for delivery of the media 

15 data 205 based on the address information described in the 
changed metadata 1802 and 1803 . The media delivering apparatus 
401 receives the request 208, and performs a delivery 210 of 
the media data 205 to the media receiving apparatus 402. The 
media receiving apparatus 402 receives the media data 205 

20 delivered thereto, and provides a synchronous presentation of 
the media data based on the presentation layout information 
described in the metadata 1802 and 1803 which have been newly 
created based on the change commands 2001 and 2002. 

Thus, the present embodiment offers an advantage of, when 

25 the degree of media importance 204 assigned to media data is 
changed, being able to reduce the amount of metadata to be 
delivered to the media receiving apparatus by sending a change 
command for changing the metadata which has been already 
delivered to the media receiving apparatus. 

30 As mentioned above, in accordance with this embodiment 
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1, the media delivering apparatus 301 creates metadata 306 in 
which both address information media data 205 and presentation 
layout information about a layout of the media data 205 are 
described by taking into consideration the communication 
5 capability 312 of the network 203 and the receiving capability 
313 of the media receiving apparatus 202, as well as the degree 
of media importance 204 assigned to each media data, and 
delivers the metadata to the media receiving apparatus 202, and 
the media receiving apparatus 202 presents the media data 205 

10 based on the metadata 306 delivered thereto. Therefore, the 
present embodiment offers an advantage of being able to present 
media data 205 normally, and to deliver the media data 205 which 
are adapted to each media receiving apparatus 202. 

Furthermore, in accordance with this embodiment 1, the 

15 media delivering apparatus 401 creates metadata 406 in which 
both address information about media data 205 and presentation 
layout information about a layout of the media data 205 are 
described by taking into consideration the communication 
capability 312 of the network 203 and the receiving capability 

20 313 of the media receiving apparatus 402, as well as the changed 
degree of media importance 204, and delivers the metadata to 
the media receiving apparatus 402, and the media receiving 
apparatus 402 presents the media data 205 based on the metadata 
406 delivered thereto. Therefore, the present embodiment 

25 offers another advantage of being able to present media data 
205 normally even when the degree of media importance 204 
assigned to each media data is changed, and to deliver the media 
data 205 which are adapted to each media receiving apparatus 
402. 

30 In addition, in accordance with this embodiment 1, when 
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the degree of media importance 204 assigned to media data is 
changed, the media delivering apparatus 401 delivers a change 
command 406 for changing the delivered metadata which has been 
already delivered to the media receiving apparatus. Therefore, 
5 the present embodiment offers a further advantage of being able 
to reduce the amount of metadata. 

Embodiment 2 . 

Also in this embodiment 2, although an explanation will 

10 be made assuming that media data 205 is surveillance video image 
data encoded with MPEG-4 Visual, metadata is written in SMIL, 
RTP is used as a media delivery protocol and RTSP is used as 
a media delivery control protocol, this embodiment 2 is not 
limited to this specific example from a workable standpoint. 

15 In accordance with above-mentioned embodiment 1, only the 

degree of media importance 204 assigned to each media data 205 
at the current time is known, while a degree of media importance 
which will be assigned to any media data in future is unknown. 
That is, when the degree of media importance 204 assigned to 

20 media data is changed, the media delivering apparatus 401 needs 
to notify the media receiving apparatus 402 that the degree of 
media importance 204 assigned to the media data is changed using 
the metadata 406 or a change command 406. 

In this embodiment 2, a case where a time-varying degree 

25 of media importance is assigned to each media data 205 in advance 
will be explained. That is, in accordance with this embodiment 
2, a plurality of media data 205, as well as the time-varying 
degree of media importance assigned to each media data 205, are 
stored in the media delivering apparatus in advance, and the 

30 media delivering apparatus searches for media data 205 to be 
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delivered and creates metadata by setting the time-varying 
degree of media importance and so on as parameters. 

Fig. 27 is a diagram explaining a media delivery system 
which uses metadata in accordance with embodiment 2 of the 
5 present invention. In Fig. 27, a media delivering apparatus 
501 is connected to a media receiving apparatus 402 by way of 
a network 203, and can access the time-varying degree of media 
importance 504 assigned to any media data 205 and any media data 
205. 

10 The operation of the media delivery system shown in Fig. 

27 will be explained. 

There is one or more media data 205 in the media delivery 
system, and a time-varying degree of media importance 504 is 
assigned to each media data 205. Each media data 205 has an 

15 address which is uniquely discriminable within the system. In 
this embodiment 2, the time-varying degree of media importance 
504 assigned to each media data has a time-varying broadening, 
that is, the time-varying degree of media importance is stored 
in a form of having time-varying variations. The media 

20 delivering apparatus 501 differs from the media delivering 
apparatus 401 of Fig. 17 in that it additionally has a function 
of creating metadata 506 having the time-varying degree of media 
importance 504 as a parameter. To be more specific, when 
receiving a request for delivery of media data from the media 

25 receiving apparatus 402, the media delivering apparatus 501 
uses one or more of parameters shown in the following (b) to 
(d) in addition to a parameter shown in the following (f ) : 

(b) The communication capability 312 of the network 203 

(c) The screen size as the receiving capability 313 of the media 
30 receiving apparatus 402 



47 

(d) The decoding capability as the receiving capability 313 of 
the media receiving apparatus 402 

(f) The time-varying degree of media importance 504 

Thus, by using one or more of the parameters shown in (b) 
5 to (d) in addition to the parameter shown in (f ) , and by taking 
into consideration not only the time-varying degree of media 
importance 204 but the communication capability 312 of the 
network 203 and the receiving capability 313 of the media 
receiving apparatus 402, the media delivering apparatus selects 

10 one or more media data 205 to be delivered, creates metadata 
506 in which address information about the location of each 
selected media data 205 and time-varying presentation layout 
information are described by taking into consideration 
variations in the presentation layout which are delivered to 

15 the media receiving apparatus in advance according to the 
time-varying degree of media importance 504, and carries out 
a delivery 207 of the metadata 506 to the media receiving 
apparatus 402. The media receiving apparatus 402 receives the 
metadata 506, and issues a request 208 for delivery of the media 

20 data 205 based on the address information described in the 
metadata. The media delivering apparatus 501 receives the 
request 208, sends a request response 209 to the media receiving 
apparatus 402, and carries out a delivery 210 of the media data 
205 to the media receiving apparatus. The media receiving 

25 apparatus 402 receives the media data 205 delivered thereto, 
and provides a presentation of the media data 205 based on the 
time-varying presentation layout information described in the 
metadata 506. The timing at which the media receiving apparatus 
switches among the video images is described in the metadata 

30 506, and the media receiving apparatus 402 issues a request 208 
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for delivery of the media data 205 at the timing at which it 
switches among the video images based on the address information 
described in the metadata, and switches among the presented 
images one by one. 
5 The media delivering apparatus 501 can create "minimum 

amount of initial metadata required for starting presentation 
of media data, and change command associated with a 
subsequently-changed portion" 506, instead of the metadata 506, 
and can perform a delivery 507 of the initial metadata and change 

10 command to the media receiving apparatus. In this case, the 
change command includes an execution time at which it is 
executed. In the case of such the "initial metadata and change 
command" 506, the media receiving apparatus 402 interprets the 
change command at the execution time when the change command 

15 is executed so as to update the metadata which is being executed. 
This variant offers the same advantage by performing such 
processing, like the above-mentioned example in which the media 
receiving apparatus receives the whole metadata 506 in which 
all time-varying variations are included. In general, since 

20 "initial metadata and change command" 506 has a smaller amount 
of data than the whole metadata 506, and provides the same 
advantage, it comes in useful. 

Fig. 28 is a block diagram showing the internal structure 
of the media delivering apparatus 501. This media delivering 

25 apparatus 501 differs from the media delivering apparatus 401 
of Fig. 18 in that the time-varying degree of media importance 
504 is provided instead of the degree of media importance 204 
and therefore the importance change monitoring unit 3106 is 
omitted, a media selecting unit 3306 is provided instead of the 

30 media selecting unit 3107, and a transmission-data generating 



49 

unit 3307 is provided instead of the transmission-data 
generating unit 3109. 

The operation of the media delivering apparatus shown in 
Fig. 28 will be explained. 
5 The media selecting unit 3306 differs from the media 

selecting unit 3107 in that it additionally has a function of 
performing a selection of media data by setting the time-varying 
degree of media importance 504 as parameters. 

That is, the media selecting unit 3306 selects media data 

10 205 to be delivered in consideration of the time-varying degree 
of media importance assigned to each media data in advance by 
selecting media data 205 to be delivered based on the 
time-varying degree of media importance 504 assigned to each 
media data and at least one of the communication capability 312 

15 of the network 203, and the receiving capability 313 of the 
media receiving apparatus 402, and notifies the selected media 
data to the transmission-data generating unit 3307. 

The transmission-data generating unit 3307 creates 
metadata 506 in which both address information indicating the 

20 location of each selected media data 205 and presentation layout 
information indicating a layout of the media data on the screen 
of the media receiving apparatus 402, which is determined based 
on both the time-varying degree of media importance 504 assigned 
to each selected media data 205 and at least one of the 

25 communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402, 
are described, and outputs the metadata, or creates, as well 
as initial metadata at the time of start of the presentation 
in which both the address information indicating the location 

30 of each selected media data 205 and the presentation layout 
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information indicating a layout of the media data on the screen 
of the media receiving apparatus 402, which is determined based 
on both the time-varying degree of media importance 504 assigned 
to each selected media data 205 and at least one of the 
5 communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402, 
are described, a change command for changing the initial 
metadata according to time-varying variations in the 
time-varying degree of media importance 504, and outputs the 

10 created "initial metadata and change command" 506. 

Thus, the transmission-data generating unit 3307 creates 
either metadata 506 about the media data 205 selected by the 
media selecting unit 3306, or "initial metadata and change 
command" 506, and then delivers it to the data transmitting unit 

15 2708. The explanation of the change command is already done 
with reference to Fig. 24. The other operations of the media 
delivering apparatus are the same as those of the media 
delivering apparatus 401 of Fig. 18. 

A surveillance system will be explained with reference 

20 to Fig. 27 as an example of the media delivery system. An 
explanation will be given for a case in which all the parameters 
shown in the above-mentioned (b) to (d) and shown in the 
above-mentioned (f ) are used, and the communication capability 
312 of the network 203 and the receiving capability 313 of the 

25 media receiving apparatus 402 are all high in the example of 
Fig. 27. 

Fig. 2 9 is a diagram showing an example of the metadata 
506 in a case the time-varying variations in the degree of media 
importance and the time when the degree of media importance is 
30 changed are known in advance. This diagram shows a case where 
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surveillance video images of points A to E at 14:00 to 14:30 
are monitored, and change in the presentation layout is set 
up while increase in the importance assigned to the point B is 
taken into consideration in advance, as shown in lists of 
5 degrees of importance 2104 and 2105. In this metadata 2101, 
a par tag 2102 shows that a simultaneous presentation of media 
data 205 specified by a text closed by </par> is provided, dur 
shows a presentation time, and begin of each video tag 2103 shows 
a start time of presentation. 

10 The media delivering apparatus 501 creates metadata which 

causes a transition from a presentation layout of images at 
14:00 tol4:10, such as images displayed on the entire screen 
1001 of Fig. 20, to a presentation layout of images at 14:10 
tol4:30, such as images displayed on the entire screen 1002 of 

15 Fig. 20, and performs a delivery 207 of the metadata 2101 as 
shown in Fig. 29 which causes the transition to the media 
receiving apparatus 402. The media receiving apparatus 402 
receives the metadata 2101 of Fig. 29, and issues a request 208 
for delivery of the media data 205 based on the address 

20 information described in the first half of the metadata 2101. 
The media delivering apparatus 501 receives the request 208, 
and then performs a delivery 210 of the media data 205 to the 
media receiving apparatus 402. The media receiving apparatus 
402 receives the media data 205 delivered thereto, and presents 

25 the media data so that they have a presentation layout on the 
entire screen 1001 of Fig. 20 based on the time-varying 
presentation layout information described in the metadata 2101 . 

After that, when the presentation time reaches 14:10, the 
media receiving apparatus 402 suspends the presentation of the 

30 media data 205 currently being presented, and performs a request 
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208 for delivery of the media data 205 based on the address 
information described in the latter half of the metadata 2101 
of Fig. 29. The media delivering apparatus 501 receives the 
request 208 , and performs a delivery 210 of the media data 205 
5 to the media receiving apparatus 402. The media receiving 
apparatus 402 receives the media data 205 delivered thereto, 
and presents the media data so that they have a presentation 
layout on the entire screen 1002 of Fig. 20 based on the 
time-varying presentation layout information described in the 

10 metadata 2101. 

Fig. 30 is a diagram showing an example of "initial 
metadata and change command" 506 in a case where the 
time-varying variations in the degree of media importance and 
the time when the degree of media importance is changed are known 

15 in advance . The same effect is produced even if a change command 
2202 containing initial metadata 2201 at the start time of 
presentation, which is least required at the start time of 
presentation, and the change time are transmitted 
simultaneously as shown in Fig. 30, instead of the metadata 2101 

20 of Fig. 29. Thus, the use of the change command 2202 makes it 
possible to produce the same effect with a smaller amount of 
data than in the case of transmission of all of the metadata 
2101. 

Although Fig. 29 shows an example of monitoring six points 
25 in a case where the time-varying variations in the degree of 
media importance and the time when the degree of media 
importance is changed are known in advance, as an example of 
surveillance application, there can be provided an application 
example of, when monitoring two or more sections, for example, 
30 two or more buildings, the above-mentioned six points belonging 
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to the same section, creating metadata 506 dynamically based 
on a degree of importance assigned to a specified time period 
within the section when triggered by a request for delivery of 
video images captured for the section during the specified time 
5 period, which is issued by the user of the media receiving 
apparatus 4 02, and delivering the metadata to the media 
receiving apparatus 402 by means of the media delivering 
apparatus 501. In the case of such a surveillance application 
example, the media receiving apparatus 402 can allow the user 

10 to select a section for which it makes a request for delivery 
of video images and to specify the date and time of starting 
presentation of the video images, and the media delivering 
apparatus 501 creates metadata 506 based on the importance at 
the start time of presentation which is effective only in the 

15 section selected by the user. 

It can be also considered that the media delivering 
apparatus 501 offers information which enables the user to 
specify the date and time of starting presentation of the video 
images . 

20 In this case, when the user of the media receiving 

apparatus 402 specifies a specific time period and a threshold 
(e.g., a medium or high degree of importance) for the media 
delivering apparatus 501, for example. The media delivering 
apparatus 501 refers to the time-varying degree of media 

25 importance 504, and notifies the time which falls within the 
time period specified by the user and when the time-varying 
degree of media importance 504 exceeds the threshold for media 
importance to the media receiving apparatus 402. In this 
application example, the user of the media receiving apparatus 

30 402 can specify the date and time of starting presentation of 
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the video images based on the information notified from the 
media delivering apparatus. 

Fig. 31 is a diagram showing an example of the presentation 
layout and an example of the metadata in a case of simultaneously 
5 monitoring four points (e.g., points A to D) . This example 
shows a case where a video image 2302 of point A, a video image 
2303 of point B, a video image 2304 of point C, and a video image 
2305 of point D are displayed on the entire screen 2301 based 
on a list of degrees of importance 2306, and a large number of 

10 video images at times which alarms were respectively issued at 
different points are checked at a time and at any time. The 
metadata 2307 is used to implement this example. 

In the example of Fig. 27, when all the parameters shown 
in the above-mentioned (b) to (d) and (f) are used, the 

15 communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402 
are all high, and the degree of media importance assigned to 
media data captured at each of the plurality of points is known 
in advance as shown in Fig. 31. The media delivering apparatus 

20 501 creates either metadata 2307 or. "initial metadata and change 
command" for carrying out simultaneous presentation of video 
images respectively captured at the plurality of points and at 
times each having a high degree of importance so as to produce 
a presentation layout of the video images respectively captured 

25 at the plurality of points on the entire screen 2301 shown in 
Fig. 31. While the example of Fig. 29 is the one in which a 
simultaneous presentation of video images captured at an 
identical time is provided, and the example of Fig. 30 is the 
one in which an enlarged display of a video image of a point 

30 having a high degree of media importance is carried out, the 
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example of Fig. 31 is the one in which the highest degree of 
importance means generation of an alarm, and the user is allowed 
to check a large number of video images at times which alarms 
were respectively issued at different points at a time and at 
5 any time. 

Fig. 32 is a diagram for explaining another example of 
the media delivery system in accordance with embodiment 2 of 
the present invention. This media delivery system differs from 
the media delivery system of Fig. 27 in that a search unit 3414 

10 and attribution information 3417 are additionally provided, and 
a degree of media importance 3418 created by the search unit 
3414 is disposed instead of the time-varying degree of media 
importance 504. 

The operation of the media delivery system shown in Fig. 

15 32 will be explained. 

In this example, it is assumed that the attribution 
information 3417 includes alarm information associated with 
each media data 205. The media receiving apparatus 402 gets 
access 3415 to a search unit 3414 according to, for example, 

20 such a keyword as "occurrence of abnormal conditions in 
temperature sensor." The search unit 3414 refers to the 
attribution information 3417 associated with each media data 
205, and creates a degree of media importance 3418 according 
to each media data 205 associated with an alarm indicating 

25 "occurrence of abnormal conditions in temperature sensor." 
For example, the search device 3414 creates media importance 
data 3418 in which a "medium degree" of media importance is 
assigned to each media data 205 associated with an alarm 
indicating "occurrence of abnormal conditions in temperature 

30 sensor", while a "high degree" of media importance is assigned 
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to media data 205 associated with a most-recently-issued alarm 
indicating "occurrence of abnormal conditions in temperature 
sensor." The media delivering apparatus 501 sets, as 
parameters, the created media importance data 3418, the 

5 communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402, 
dynamically determines a presentation layout to be delivered, 
creates either metadata 506 or "initial metadata and change 
command" 506, and performs a delivery 207 of the metadata or 

10 "initial metadata and change command" to the media receiving 
apparatus 402. To be more specific, the media delivering 
apparatus 501 selects media data 205 to be delivered based on 
the created media importance data 3418. Therefore, the media 
delivering apparatus selects, as the media data 205 to be 

15 delivered, a plurality of media data 205 each associated with 
an alarm indicating "occurrence of abnormal conditions in 
temperature sensor." The number of media to be delivered 
simultaneously and the quality of each media to be delivered 
are determined by the media delivering apparatus 501 based on 

20 the parameters, such as the communication capability 312 of the 
network 203 and the receiving capability 313 of the media 
receiving apparatus 402, as in the case of any above-mentioned 
embodiment. For example, when the communication capability 
312 and the receiving capability 313 are all high, the media 

25 delivering apparatus creates a presentation layout of the 
selected media data on an entire screen 601 as shown in Fig. 
10. In this presentation layout, media data 205 having a "high 
degree" of importance is displayed with a VGA resolution, and 
each remaining media data 205 having a "medium degree" of 

30 importance is displayed with an SQVGA resolution. In contrast, 
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when the communication capability 312 and the receiving 
capability 313 are all low, the media delivering apparatus sets 
the number of video images to be delivered at a time to 1, and 
creates a presentation layout of the selected media data such 

5 that media data 205 having a "high degree" of importance is 
started and, after that, remaining media data 205 each having 
a "medium degree" of importance are presented one by one. 

The media delivering apparatus 501 selects media data 205 
associated with an alarm indicating "occurrence of abnormal 

10 conditions in temperature sensor." Therefore, when 

simultaneously delivering a plurality of media data 
respectively associated with alarms indicating "occurrence of 
abnormal conditions in temperature sensor" which were issued 
at different times to the media receiving apparatus, the media 

15 delivering apparatus creates metadata 506 which causes a 
simultaneous delivery of the media data obtained at different 
times. Since the media delivering apparatus 501 creates such 
the metadata 506, the media receiving apparatus 402 can check 
the plurality of media data 205 which are search results which 

20 match with the search conditions simultaneously and efficiently. 
When receiving and presenting the plurality of media data 
obtained at different times simultaneously, the media receiving 
apparatus 402 presents the plurality of media data while 
achieving synchronization among the plurality of media data 205. 

25 Furthermore, even when the number of media data to be delivered 
simultaneously is one, since the plurality of media data are 
presented one by one in the order of importance, the user of 
the media receive terminal 402 can check the plurality of media 
data efficiently. 

30 As mentioned above, in accordance with this embodiment 
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2, the media delivering apparatus 501 takes into consideration 
the communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402, 
as well as the time-varying degree of media importance 504 
assigned to each media data, creates metadata 506 in which 
address information about each selected media data 205 and 
time-varying presentation layout information are described, 
and delivers the metadata 506 to the media receiving apparatus 
402, and the media receiving apparatus 402 presents the media 
data 205 based on the metadata 506 delivered thereto. Therefore, 
the present embodiment offers an advantage of being able to 
present media data 205 normally and to deliver media data 205 
which are adapted to each media receiving apparatus 402. 

Furthermore, in accordance with this embodiment 2, the 
media delivering apparatus takes into consideration the 
communication capability 312 of the network 203 and the 
receiving capability 313 of the media receiving apparatus 402, 
as well as the time-varying degree of media importance 504 
assigned to each media data, creates "initial metadata and 
change command" 506 in which address information about media 
data 205 and presentation layout information are described, and 
delivers the initial metadata and change command to the media 
receiving apparatus 402. Therefore, the present embodiment 
offers another advantage of being able to reduce the amount of 
metadata . 

In addition, in accordance with this embodiment 2, the 
search unit 3414 creates media importance data 3418 
corresponding to media data 205 which are searched using a 
predetermined keyword, the media delivering apparatus 501 takes 
into consideration the communication capability 312 of the 
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network 203 and the receiving capability 313 of the media 
receiving apparatus 402, as well as the degree of media 
importance 3418 assigned to each media data, creates metadata 
506 in which address information about each selected media data 

5 205 and presentation layout information are described, and 
delivers the metadata 506 to the media receiving apparatus 402, 
and the media receiving apparatus 402 presents the media data 
205 based on the metadata 506 delivered thereto. Therefore, 
the present embodiment offers a further advantage of being able 

10 to present media data 205 associated with the predetermined 
keyword normally and to deliver media data 205 which are adapted 
to each media receiving apparatus 402. 

Industrial Applicability 

15 As mentioned above, a media delivering apparatus and a 

media receiving apparatus in accordance with the present 
invention are suitable for presenting media data 205 normally 
and delivering media data 205 which are adapted to each media 
receiving apparatus 202 by taking into consideration the 

20 communication capability 312 of a network 203 and the receiving 
capability 313 of the media receiving apparatus 202, as well 
as the degree of media importance 204 assigned to each media 
data, creating metadata 506 in which address information about 
each selected media data 205 and presentation layout 

25 information are described, and delivering the metadata 506 to 
the media receiving apparatus, and presenting the media data 
205 based on the delivered metadata 506. 



